Accelerating Reinforcement Learning by Composing Solutions of Automatically Identified Subtasks

نویسنده

  • Chris Drummond
چکیده

This paper discusses a system that accelerates reinforcement learning by using transfer from related tasks. Without such transfer, even if two tasks are very similar at some abstract level, an extensive re-learning eeort is required. The system achieves much of its power by transferring parts of previously learned solutions rather than a single complete solution. The system exploits strong features in the multi-dimensional function produced by reinforcement learning in solving a particular task. These features are stable and easy to recognize early in the learning process. They generate a partitioning of the state space and thus the function. The partition is represented as a graph. This is used to index and compose functions stored in a case base to form a close approximation to the solution of the new task. Experiments demonstrate that function composition often produces more than an order of magnitude increase in learning rate compared to a basic reinforcement learning algorithm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Accelerating Reinforcement Learning by Composing Solutions of Automatically Identi ed Subtasks

This paper discusses a system that accelerates reinforcement learning by using transfer from related tasks. Without such transfer, even if two tasks are very similar at some abstract level, an extensive re-learning e ort is required. The system achieves much of itslevel, an extensive re-learning e ort is required. The system achieves much of its power by transferring parts of previously learned...

متن کامل

Ibots Learn Genuine Team Solutions

\Ibots" (Integrating roBOTS) is a computer experiment in group learning. It is designed to understand how to use reinforcement learning to program automatically a team of robots with a shared mission. Moreover, we are interested in deriving genuine team solutions. These are policies whose form strongly depends on the number of robots composing the team, on their individual skills and weaknesses...

متن کامل

Safe State Abstraction and Reusable Continuing Subtasks in Hierarchical Reinforcement Learning

Hierarchical reinforcement learning methods have not been able to simultaneously abstract and reuse subtasks with discounted value functions. The contribution of this paper is to introduce two completion functions that jointly decompose the value function hierarchically to solve this problem. The significance of this result is that the benefits of hierarchical reinforcement learning can be exte...

متن کامل

Ibots: Learning Real Team Solutions

This paper presents \Ibots" (Integrating roBOTS), a computer experiment in team robotics designed on an arti cial mission. Our aim is to understand how to use reinforcement learning to program automatically a team of robots with a shared mission. Moreover, we are interested in learning real team solutions. These are programs whose form strongly depends on the number of robots composing the team...

متن کامل

Learning Real Team Solutions

This paper presents \Ibots" (Integrating roBOTS), a computer experiment in group learning designed on an arti cial mission. By this experiment, our aim is to understand how to use reinforcement learning to program automatically a team of robots with a shared mission. Moreover, we are interested in learning real team solutions. These are programs whose form strongly depends on the number of robo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • J. Artif. Intell. Res.

دوره 16  شماره 

صفحات  -

تاریخ انتشار 2002